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Abstract—This paper is intended to investigate how the Dai 
Dialect speaker in Dehong Autonomous Prefecture perceives 
Mandarin tones. Therefore identification and discrimination 
experiments are conducted of the four Mandarin tone 
continuums with Dai Language speakers. The results suggested 
that the Dai Language speaker’s perception mode of T1-T2 and 
T2-T4 continuums are Categorical Perception, but for T1-T3, T2- 
T3, T3-T4 and T1-T4 continuums are not. These findings indicate 
that Dai language tone system does have a significant influence 
on the perception of Mandarin tones. 
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I. INTRODUCTION 


Categorical perception (CP) has been one of the most 
extensively studied phenomena in speech perception for nearly 
50 years because it is believed to reflect some fundamental 
aspects of the processing of speech sounds [1]. Most of the 
Earlier CP researches concentrated on segmental features of 
speech, i.e., consonants and vowels. The consonants were 
considered perceived categorically which means there is a clear 
linguistic boundary in the continuum between two consonants, 
whereas vowels are not [2, 3, 4]. Since 1970s, more and more 
attention has been put to the suprasegmental features of speech, 
especially the tone. Wang [S]and Francis et al.[6,7] proved that 
a continuum ranging from a high level tone to a high rising 
contour tone is perceived categorically by native speakers 
Abramson and Francis et al. [6,7] pointed out that a continuum 
ranging from one or more level tone to another is not perceived 
categorically by native speakers. 


Mandarin Chinese has four lexical tones and can be 
described phonetically as T1-high level, T2-high rising, T3- 
falling rising, and T4-high falling, respectively. Though not 
without controversy, the consensus opinion has been that the 
continuums between T1-T3, T1-T3 and T1-T4 are perceived 
categorically whereas the continuum ranging from T2-T3 is not 
[10, 11, 12]. In addition, the linguistic experience has a crucial 
influence on the development of CP. Wang [5] first 
demonstrated that there is no linguistic boundary for native 
American English (non-tone language) subjects. Since then, 
several studies of Mandarin tone perception have investigated 


the impact of language background, but most focusing on the 
contrast between native tone language and non-tone language 
subjects. However, different inventories of tones may also 
influence tone perception of another language. As we all know, 
there are so many tone languages in china, the tone numbers of 
some minor languages are even more than Mandarin. Therefore, 
how will these tone-language speakers perceive Mandarin? 

Will they establish the similar CP as native Mandarin speakers? 
There is still a lack of studies on whether and how different 

tone systems affect tone perception in terms of CP. 


In this study, we choose the Dai language speakers in 
Dehong Autonomous Prefecture as subjects to examine their 
perception of Mandarin tones and the influence of their 
mother-tongue. Dai language which is used by Dehong branch 
of Dai ethnic group is part of Sino-Tibetan language family. 
According to previous studies [13, 14, 15], Dai language in 
Dehong has nine tones for monosyllabic words: three level 
tones, four falling tones and two rising tone. Their tone values 
are: 24\35\5 I\54\3 1\44\33\55\41. This paper therefore 
undertook a perceptual experiment of the four Mandarin tone 
continuums with Dai Language speakers. 


II. EXPERIMENT 


A. Subjects 


Thirty native speakers of Dai language in Dehong (15males; 
15 females) participated in this experiment. They are similar in 
age and years of formal education. All participants were from 
Jiedong village, Ruili County, Dehong Autonomous Prefecture 
and can also speak Mandarin. All participants exhibited normal 
hearing sensitivity and were paid for their participation. 


B. Stimuli 


The stimuli were derived from the Mandarin syllable /da/ 
with four Mandarin tones(T1: dal, “take”; 
T2:da2,“reach”;T3:da3,“hit”;T4:da4,“big’’) and was 
pronounced by a female native speaker. The four naturally 
produced /da/ syllables were normalized to a duration of 500 
ms. Then 12 pairs of a tone continuum were constructed 
manipulating FO from each of the four /da/ syllables to the 
other ones with the Praat software (Boersma and Weenink 


2009). Take T1-T2 as an example: we first get 11 pitch points 
of FO to plot the pitch contour of “dal” and “da2”, and then 
compute 9 pitch contours between the two syllables’ original 
FO by dragging the 11 pitch points equidistantly. As the 
phonation cues also have an influence in CP perception [10, 
16], we use “dal” and “da2” as the original sample 
respectively. That means there are two continuum subgroups 
with the same change of FO parameters but different phonation 
information of the T1-T2 group. Finally, 11 steps of stimuli in 
each of the 12 pairs of tone continuum were synthesized to new 
sound files in Praat. Figure 1 showed the schematic diagram of 
FO manipulation of the 11 stimuli for the pairs of tone 
continuum. 
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Fig. 1. Schematic diagram of 11-step FO manipulation for the pairs of tone 
continuum. 


C. Procedure 


1) Identification task 

In the identification task, participants listened to stimuli 
from the 12 continuum in random order. The 11 stimuli of each 
continuum were repeated 2 times in the whole identification 
session, forming a total of 132 trials. During each trial, there 
will be two Chinese characters on the screen of the computer, 
the subjects were instructed to press the left or right mouse 
button to choose which one is the character they heard. Once a 
response was collected, the next stimulus was presented 
automatically. Participants were allowed to press a “pause” 
button to take a self-controlled short break. 


2) Discrimination task 

In the discrimination task, stimuli were presented in pairs 
with a 500-ms inter stimulus interval. A total of 324 pairs were 
presented in random order. Of these pairs, 216 consisted of two 
different stimuli separated by two steps of the continuum, in 


either forward (1-3, 2-4 ...9-11) or reverse order ( 11-9, 10- 
8...3-1). The remaining 108 pairs contained one of the middle 
nine stimuli of the continuum paired with itself (same pairs: 2- 
2, 3-3...9-9). There were 2 occurrences per stimulus pair. After 
hearing each pair, participants were instructed to judge whether 
the two stimuli were the same or different, and to respond by 
pressing a mouse button left= “same,” right= “different”. 
Participants were also allowed to press a “pause” button to take 
a self-controlled short break. 


D. Data analysis 


CP has the following characteristics [17]: In the labeling 
function, there is a sharp boundary between two categories; in 
the discrimination function, accuracy peaks at the category 
boundary, but is at or near chance level within category; the 
discrimination function can be predicted from the identification 
function. However, the third characteristic is controversial, so 
we obtained individual results for each subject based on the 
first two essential characteristics of CP: category boundary 
position and width ; corresponding discrimination peak. 


3) Identification scores 

The identification score was defined as the percentage of 
responses with which participants identified that stimulus as 
being either ‘Sound1’ or ‘Sound2. Then the boundary position 
and boundary sharpness were measured by applying logistic 
regression analysis as Xu, Gandour, and Francis [1]. The 
boundary position was defined as the 50% crossover points, 
and the boundary width was defined as the linear distance 
between the 25% and 75% crossover points as determined by 
the mean and standard deviation obtained from probit analysis. 


4) Discrimination scores 

In the discrimination task, stimuli were presented in pairs 
with a 500-ms inter stimulus interval. A total of 324 pairs were 
presented in random order. Of these pairs, 216 consisted of two 
different stimuli separated by two steps of the continuum, in 
either forward (1-3, 2-4 ...9-11) or reverse order ( 11-9, 10- 
8...3-1). The remaining 108 pairs contained one of the middle 
nine stimuli of the continuum paired with itself (same pairs: 2- 
2, 3-3...9-9). There were 2 occurrences per stimulus pair. After 
hearing each pair, participants were instructed to judge whether 
the two stimuli were the same or different, and to respond by 
pressing a mouse button left= “same,” right= “different”. 
Participants were also allowed to press a “pause” button to take 
a self-controlled short break. 


HI. RESULTS AND DISCUSSION 


The category boundary position and width obtained by 
probit analysis of the 12 groups are presented in Table 1. These 
parameters and the discrimination scores were used to plot the 
identification and discrimination curves in Fig. 1. Results of 
Repeated Measure ANOVA showed that continuum subgroups 
based on the same change of FO parameters but different 
phonation information in T2-T3 and T3-T4 groups were 
perceived significant differently [F (1,654) = 9.124, p<0.001; F 
(1,654) = 24.952, p<0.001], whereas the phonation effect in 
other groups were not significant (p>0.001), which means the 
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phonation information only contributed to the perception of 
Mandarin Tone 3 in this study. Therefore, we merged the data 
of each two subgroup with the same change of FO parameters 
but different phonation information together. 


TABLE 1. DERIVED CATEGORICAL BOUNDARY POSITION AND WIDTH 


Group | Boundary | Width 
T1-T2 4.57 2.59 
T1-T3 5.37 1.98 
T1-T4 4.28 2.19 
T2-T3 6.70 2.71 
T2-T4 6.12 1.44 
T3-T4 5.86 1.73 
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Fig. 2. Logistic identification (solid lines) and two-step discrimination 
curves (dashed lines) “obtained” from the discrimination and identification 
score. 


For the identification results, the category boundary is clear 
and sharp for all the continuums as we can see from the Table 
1 and Fig. 1. The boundary width of T2-T4 is the narrowest 
and T2-T3 is the widest. 


For the discrimination results, one-way ANOVA revealed 
that the discrimination scores were significantly different 
among the 9 stimulus pairs of T1-T2, T2-T4 and T1-T4 
continuums [F (8,261) = 10.299, p<0.001; F (1,654) = 13.093, 
p<0.001; F(8,261)= 18.438,p<0.001], but not significantly 
different in T1-T3, T2-T3, T3-T4 continuums [F (8,261) = 
2.556, p>0.001; F (1,654) = 1.597, p>0.001; F(8,261)= 3.636, 
p>0.001].That means, discrimination peaks can be found 
clearly in the perception results of T1-T2, T2-T4 and T1-T4 
continuums whereas there were no discrimination peaks in the 
perception results of T1-T3, T2-T3 and T3-T4 continuums. In 
addition, the discrimination peak of T1-T4 continuum was not 
corresponding to its category boundary according to Fig.1. 


To sum up, steep categorical boundaries and corresponding 
discrimination peaks were only found in the perception results 
of T1-T2 and T2-T4 continuums, which indicating a typical CP 
for Dai speakers in Mandarin T1-T2 and T2-T4. Nevertheless, 
CP was not successfully established in T1-T3, T2-T3 and T3- 
T4, which may due to the lack of similar contour tone to 
Mandarin Tone 3 in Dai language in Dehong. Furthermore, Dai 
speakers also cannot perceive T1-T4 categorically may on 
account of there are much more falling tones and level tones in 
Dai language in Dehong than in Mandarin, which may make 
the subjects more sensitive to the change from level to falling 
in Mandarin tones. 


IV. CONCLUSION 


In this study, we have examined the influence of Dai 
language experience on the perception of Mandarin tones in the 
framework of CP. From the identification and discrimination 
experiments, we see clearly that the Dai speakers in Dehong 


perceived the contrast of level vs. rising and level vs. falling 
tones in Mandarin categorically, while they didn’t establish 
categorical perception for the contrast of level vs. falling, and 
falling-rising vs. other tones. These results were shaped and 
contributed to the different tone inventories of Dai language in 
Dehong. 
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